App Serial # 09/930.871 Exhibit F 

Turner etaL LEX-0216-USA 
Polynucleotides Encoding Human Ion Channel Proteins (As Amended) 



>AF225985 ACCESSION: AF22 5985 NID: gi 12642269 gb AF225985.1 AF225985 
Homo sapiens voltage-gated sodium channel alpha subunit 
SCN1A (SCN1A) rnRNA, complete cds 
Length = 8131 

Score = 3987 bits (10226) , Expect = 0.0 

Identities = 1990/2010 (99%), Positives = 1990/2010 (99%), Gaps =1/2010 (0%) 
Frame = +1 

Query: 1 MEQTVLVPPGPDSFNFFTRESLAAIERRIAEEKAKNPKPDKKDDDENGPKPNSDLEAGKN 60 

MEQTVLVPPGPDSFNFFTRESLAAIERRIAEEKAKNPKPDKKDDDENGPKPNSDLEAGKN J 
Sbjct: 19 MEQTVLVPPGPDSFNFFTRESLAAIERRIAEEKAKNPKPDKKDDDENGPKPNSDLEAGKN 198 

Query: 61 LPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVLNKGKAIFRFSATSALYILTPFNPLRKI 12 0 

LPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVLNKGKAIFRFSATSALYILTPFNPLRKI 
Sbjct: 199 LPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVLNKGKAIFRFSATSALYILTPFNPLRKI 378 

Query: 121 AIKILVHSLFSMLIMCTILTNCVFMTMSNPPDWTKNVEYTFTGIYTFESLIKIIARGFCL 180 

AIKILVHSLFSMLIMCTILTNCVFMTMSNPPDWTKNVEYTFTGIYTFESLIKIIARGFCL 
Sbjct: 379 AIKILVHSLFSMLIMCTILTNCVFMTMSNPPDWTKNVEYTFTGIYTFESLIKIIARGFCL 558 

Query: 181 EDFTFLRDPWNWLDFTVITFAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGAL 240 

EDFTFLRDPWNWLDFTVITFAWTEFVDLGWSALRTFRVLRALKTISVIPGLKTIVGAL 
Sbjct: 559 EDFTFLRDPWNWLDFTVITFAYVTEFVDLGNVSALRTFRVLRALKTISVIPGLKTIVGAL 738 
* ■ 

Query: 241 IQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCIQWPPTNASLEEHSIEKNITVNY 300 

IQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCIQWPPTNASLEEHSIEKNITVNY 
Sbjct: 739 IQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNKCIQWPPTNASLEEHSIEKNITVNY 918. 

Query: 301 NGTLINEWFEFDWKSYIQDSRYHYFLEGFLDALLCGNSSDAGQCPEGYMCVKAGRNPNY 360 

NGTLINETVFEFDWKSYIQDSRYHYFLEGFLDALLCGNSSDAGQCPEGYMCVKAGRNPNY 
Sbjct: 919 NGTLINEWFEFDWKSYIQDSRYHYFLEGFLDALLCGNSSDAGQCPEGYMCVKAGRNPNY 1098 

Query: 361 GYTSFDTFSWAFLSLFRLMTQDFWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLINLILA 420 

GYTSFDTFSWAFLSLFRLMTQDFWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLINLILA 
Sbjct: 1099 GYTSFDTFSWAFLSLFRLMTQDFWENLYQLTLRAAGKTYMIFFVLVIFLGSFYLINLILA 1278 

Query: 421 WAMAYEEQNQATLEEAEQKEAEFQQMIEQLKKQQEAAQQAATATASEHSREPSAAGRLS 480 

WAMAYEEQNQATLEEAEQKEAEFQQMIEQLKKQQEAAQQAATATASEHSREPSAAGRLS 
Sbjct: 1279 WAMAYEEQNQATLEEAEQKEAEFQQMIEQLKKQQEAAQQAATATASEHSREPSAAGRLS 1458 

Query: 481 DSSSEASKLSSKSAKERRNRRKKRKQKEQSGGEEKDEDEFQKSESEDSIRRKGFRFSIEG 540 

DSSSEASKLSSKSAKERRNRRKKRKQKEQSGGEEKDEDEFQKSESEDSIRRKGFRFSIEG 
Sbjct : 1459. DSSSEASKLSSKSAKERRNRRKKRKQKEQSGGEEKDEDEFQKSESEDSIRRKGFRFSIEG 1638 

Query 541 NRLTYEKRYSSPHQSLLSIRGSLFSPRRNSRTSLFSFRGRAKDVGSENDFADDEHSTFED 600 

NRLTYEKRYSSPHQSLLSIRGSLFSPRRNSRTSLFSFRGRAKDVGSENDFADDEHSTFED 
Sbjct: 1639 NRLTYEKRYSSPHQSLLSIRGSLFSPRRNSRTSLFSFRGRAKDVGSENDFADDEHSTFED 1818 

Query: 601 NESRRDSLFVPRRHGERRNSNLSQTSRSSRMLAVFPANGKMHSTVDCNGWSLVGGPSVP 660 

ireSRRDSLFVPRRHGERRNSNLSQTSRSSRMLAVFPANGKMHSTVDCNGVVSLVGGPSVP 
Sbjct: 1819 NESRRDSLFVPRRHGERRNSNLSQTSRSSRMLAVFPANGKMHSTVDCNGWSLVGGPSVP 1998 

Query: 661 TSPVGQLLPEVIIDKPATDDNGTTTETEMRKRRSSSFHVSMDFLEDPSQRQRAMSIASIL 720 

TSPVGQLLP GTTTETEMRKRRS S SFHVSMDFLED PSQRQRAMS I AS IL 

Sbjct: 1999 TSPVGQLLP GGTTT ETEMRKRRS S S FHVSMDFLED PS QRQ RAMS I AS I L 2145 



Query: 721 TNTVEELEESRQKCPPCOTKFSNIFLIWDCSPYWLKVKHVVNL\AmDPFVDLAITICIVL 780 

TNTVEELEESRQKCPPCWYKFSNIF IWDCSPYWLKVKHWNLWMDPFVDLAITICIVL 
Sbjct: 2146 TNWEELEESRQKCPPCWYKFSNIFSIWDCSPYWLKVKHVVNLVVMDPFVDLAITICIVL 2325 

Query: 781 NTLFMAMEHYPMTDHFNNVLWGNLVFTGIFTAEMFLKIIAMDPYYYFQEGWNIFDGFIV 840 

NTLFMAMEHYPMTDHFNWLTVGNLVFTGIFTAEMFLKI IAMDPYYYFQEGWNIFDGFIV 
Sbjct: 2326 NTLFMAMEHYPMTDHFNNVLWGNLVFTGIFTAEMFLKIIAM^ 2505 

Query: 841 TL SLVELGLANVEGL S VLRSFRLLRVFKLAKSWPTLNML I KI I GNS VGALGNLTLVLAI I 900 

TLSLVELGLAWEGLSVLRSFRLLRVFKLAKSWPTLNMLIKIIGNSVGALGNLTLVLAII 
Sbjct: 2506 TLSLVELGLANVEGLS VLRSFRLLRVFKLAKSWPTLNML I KIIGNS VGALGNLTLVLAI I 2685 

Query: 901 VFIFAWGMQLFGKSYKDCVCKIASDCQLP-RWHMNDFFHSFLIVFRVLCGEWIETMWDC 959 

VFIFAWGMQLFGKSYKDCVCKIASDCQLP RWHMNDFFHSFLIVFRVLCGEWI ETMWDC 
Sbjct: 2686 VFIFAWGMQLFGKSYKDCVCKIASDCQLPQRTOMNDFFHSFLIVFRVLCGEWIETMWDC 2865 

Query: 960 MEVAGQAMCLTVFMMVMVIGNLWLN^^ 1019 

MEVAGQAMCLTVFMMVMVIGNLWLNLFLALL S S F S ADNLAATDDDNEMNNLQ I AVDRM 
Sbjct: 2866 MEVAGQAMCLTVFMMVKVIGNLVVLNLFLALLL S S F S ADNLAATDDDNEMNNLQ I AVDRM 3045 

Query: 1020 HKGVAYVKRKIYEFIQQSFIRKQKILDEIKPLDDLNNKKDSCMSNHTXEIGKDLDYLKDV 1079 

HKGVAYVKRKIYEFIQQSFIRKQKILDEIKPLDDLNNKKDSCMSNHT EIGKDLDYLKDV 
Sbjct: 3046 HKGVAYVKRKIYEFIQQSFIRKQKILDEIKPLDDLNNKKDSCMSNHTTEIGKDLDYLKDV 3225 

, Query: 1080 NGTTSGIGTGSSVEKYIIDESDYMSFINNPSLTVTVPIAVGESDFENLNTEDFSSESDLE 1139 
NGTTSGIGTGSSVEKYIIDESDYMSFINNPSLTVTVPIAVGESDFENLNTEDFSSESDLE 
Sbjct: 3226 NGTTSGIGTGSSVEKYIIDESDYMSFINNPSLTVTVPIAVGESDFENLNTEDFSSESDLE 3405 

Query: 1140 ESKEKLNESSSSSEGSTVDIGAPVEEQPWEPEETLEPEACFTEGCVQRFKCCQINVEEG 1199 

ESKEKLNESSSSSEGSTV PVEEQPWEPEETLEPEACFTEGCVQRFKCCQINVEEG 
Sbjct: 3406 ESKEKLNESSSSSEGSTVGHRRPVEEQPWEPEETLEPEACFTEGCVQRFKCCQINVEEG 3585 

Query: 1200 RGKQWWNLRRTCFRIVEHNWFETFIVFMILLSSGALAFEDIYIDQRKTIKTMLEYADKVF 1259 

RGKQWWNLRRTCFRIVEHNWFETFIVFMILLSSGALAFEDIYIDQRKTIKTMLEYADKVF 
Sbjct: 3586 RGKQWWNLRRTCFRIVEHNWFETFIVFMILLSSGALAFEDIYIDQRKTIKTMLEYADKVF 3765 

Query: 1260 T YI F I LEMLLKWVAYGYQT YFTNAWCWLDFL I VDVSLVSLTANALG YS ELGAI KSLRTLR 1319 

TYIFILEMLLKWVAYGYQTYFTNAWCWLDFLIVDVSLVSLTANALGYSELGAIKSLRTLR 
Sbjct: 3766 TYIFILEMLLKWVAYGYQTYFTNAWCWLDFLIVDVSLVSLTANALGYSELGAI KSLRTLR 3945 

Query: 1320 ALRPLRALSRFEGMRVWNALLGAIPSIMNVLLVCLIFWLIFSIMGVNLFAGKFYHCINT 1379 

ALRPLRALSRFEGMRVVVNALLGAI PS IMNVLLVCLI FWLIFS IMGVNLFAGKFYHC INT 
Sbjct: 3946 ALRPLRALSRFEGMRVVVNALLGAI PS IMNVLLVCLIFWLIFS IMGVNLFAGKFYHC INT 4125 

Query: 1380 TTGDRFDIEDVNNHTDCLKLIERNETARWKN^ 1439 

TTGDRFDIEDVNNHTDCLKLIERNETARWKNVKVNFDNVGFGYLSLLQVATFKGWMDIMY 
Sbjct: 4126 TTGDRFDIEDVNNHTDCLKLIERNETARWKNVTCWFDNVGFGYLSLLQVATFKGWMDIW 4305 

Query: 1440 AAVDSRNVELQPKYEESLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIF 1499 

AAVDSRNVELQPKYEESLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIF 
Sbjct: 4306 AAVDSRNVELQPKYEESLYMYLYFVIFIIFGSFFTLNLFIGVIIDNFNQQKKKFGGQDIF 4485 

Query: 1500 MTEEQKKYYNAMKKLGSKKPQKPIPRPGNKFQGMVFDFVTRQVFDISIMILICLNMVTMM 1559 

MTEEQKKYYNAMKKLGSKKPQKPIPRPGNKFQGMVFDFVTRQVFDISIMILICLNMVTMM 
Sbjct: 4486 MTEEQKKYYNAMKKLGSKKPQKP I PRPGNKFQGMVFDFVTRQVFDI S IMIL ICLNMVTMM 4665 



Query: 1560 VETDDQSEWTTILSRINLVFIVLFTGECVLKLISLRHYYFTIGWNIFDFVWILSIVGM 1619 

VETDDQSEYVTTILSRINLVFIVLFTGECVLKLISLRHYYFTIGWNIFDFVWILSIVGM 
Sbjct: 4666 VETDDQSEYVTTILSRINLVFIVLFTGECVLKLISLRHYYFTIGWNIFDFVWILSIVGM 4845 

Query: 1620 FLAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFL 1679 

FLAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFL 
.Sbjct: 4846 FLAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLFL 5025 

Query: 1680 VMFIYAIFGMSNFAWKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSKPP 1739 

VMFIYAIFGMSNFAYVKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSKPP 
Sbjct: 5026 VMFIYAIFGMSNFAWKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNSKPP 5205 

Query: 1740 DCDPNKVNPGSSVKGDCGNPSVGIFFFVSYIIISFLVWNMYIAVILENFSVATEESAEP 1799 

DCDPNKVNPGSSVKGDCGNPSVGIFFFVSYIIISFLVWNMYIAVILENFSVATEESAEP 
Sbjct: 5206 DCDPNKVNPGSSVKGDCGNPSVGIFFFVSYIIISFLVWNMYIAVILENFSVATEESAEP 5385 

Query: 1800 LSEDDFEMFYEVWEKFDPDATQFMEFEKLSQFAAALEPPLNLPQPNKLQLIAMDLPMVSG 1859 

LSEDDFEMFYEWEKFDPDATQFMEFEKLSQFAAALEPPLNLPQPNKLQLIAMDLPMVSG 
Sbjct: 5386 LSEDDFEMFYEWEKFDPDATQFMEFEKLSQFAAALEPPLNLPQPNKLQLIAMDLPMVSG 5565 

Query: 1860 DRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYQPITTTLKRKQEEVSA 1919 

DRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYQPITTTLKRKQEEVSA 
Sbjct: 5566 DRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYQPITTTLKRKQEEVSA 5745 

n Query: 1920 VIIQRAYRRHLLKRTVKQASFTYNKNKIKGGANLLIKEDMIIDRINENSITEKTDLTMST 1979 
VI I QRAYRRHLLKRTVKQAS FT YNKNKI KGGANLL I KEDMI I DRINENS ITEKTDLTMST 
Sbjct: 5746 VI I QRAYRRHLLKRTVKQAS FT YNKNKI KGGANLL I KEDMI I DRINENS I TEKTDLTMST 5925 

Query: 1980 AAC P P S YDRVTKP I VEKH EQEGKDE KAKGK 2009 

AAC PPS YDRVTKP I VEKHEQEGKDEKAKGK 
Sbjct: 5926 AAC PPS YDRVTKP I VEKHEQEGKDEKAKGK 6015 
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AF225985 8131 bp mRNA linear PRI 01-FEB-2001 

Homo sapiens voltage-gated sodium channel alpha subunit SCN1A 
(SCN1A) mRNA, complete cds . 
AF225985 

AF225985.1 GI:12642269 

Homo sapiens (human) 
Homo sapiens 

Eukaryota; Metazoa; Chordata; Craniata; Vertebrata; Euteleostomi; 
Mammalia; Eutheria; Primates; Catarrhini; Hominidae; Homo. 

1 (bases 1 to 8131) 

Jeong,S.-Y., Goto, J. and Kanazawa,I. 

Cloning of cDNA for human voltage-gated sodium channel alpha 

subunit, SCNlA 

Unpublished 

2 (bases 1 to 8131) 
Jeong,S.-Y., Goto, J. and Kanazawa/I. 
Direct Submission 

Submitted (14-JAN-2000) Department of Neurology, University of 
Tokyo, Graduate School of Medicine, 7-3-1 Hongo, Bunkyo-ku, Tokyo 
113-8655, Japan 

Location/Qualifiers 

1. .8131 

/organism="Homo sapiens" 

/mo l_type= " mRNA " 

/ db_xref = " taxon : 9 6 0 6 " 

/chromosome= " 2 " 

/map="2q24" 

1. .8131 

/gene= M SCNlA" 

19. .6018 

/gene=" SCNlA" 

/ codon_s t ar t = 1 

/product^ "voltage-gated sodium channel alpha subunit 
SCNlA" 

/protein_id= " AAK00217 . 1 " 
/db_xref="GI: 12642270" 

/ trans la tion=" MEQTVLVPPGPDSFNFFTRESLAAIERRIAEEKAKNPKPDKKDD 
DENGPKPNSDLEAGKNLPFIYGDIPPEMVSEPLEDLDPYYINKKTFIVLNKGKAIFRF 
SATSALYILTPFNPLRKIAIKILVHSLFSMLIMCTILTNCVFMTMSNPPDWTKNVEYT 
FTGIYTFESLIKIIARGFCLEDFTFLRDPWNWLDFTVITFAYVTEFVDLGNVSALRTF 
RVLRALKTISVIPGLKTIVGALIQSVKKLSDVMILTVFCLSVFALIGLQLFMGNLRNK 
C IQWPPTNASLEEHS I EKNI TVNYNGTLINETVFEFDWKS YIQDSRYHYFLEGFLDAL 
LCGNSSDAGQCPEGYMCVKAGRNPNYGYTSFDTFSWAFLSLFRLMTQDFWENLYQLTL 
RAAGKTYMIFFVLVIFLGSFYLINLILAWAMAYEEQNQATLEEAEQKEAEFQQMIEQ 
LKKQQEAAQQAATATASEHSREPSAAGRLSDSSSEASKLSSKSAKERRNRRKKRKQKE 
QSGGEEKDEDEFQKSESEDSIRRKGFRFSIEGNRLTYEKRYSSPHQSLLSIRGSLFSP 



http://www.ncbi. nlm.nih.gov/entrez/query.fcgi?cmd=Retrieve&db=nucleotide&list - uids=1264^ 11/11/2003 
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RRNSRTSLFSFRGRAKDVGSENDFADDEHSTFEDNESRRDSLFVPRRHGERRNSNLSQ 
TSRSSRMLAVFPANGKMHSTVDCNGWSLVGGPSVPTSPVGQLLPGGTTTETEMRKRR 
SSSFHVSMDFLEDPSQRQRAMSIASILTNTVEELEESRQKCPPCWYKFSNIFSIWDCS 
PYWLKVKHWNLVVMDPFVDLAITICIVLNT^ 

IFTAEMFLKIIAMDPYYYFQEGWNIFDGFIVTLSLVELGLANVEGLSVLRSFRLLRVF 
KLAKSWPTLNMLIKIIGNSVGALGNLTLVLAIIVFIFAWGMQLFGKSYKDCVCKIAS 
DCQLPQRWHMNDFFHSFLIVFRVLCGEWIETMWDCMEVAGQAMCLTVFMMVMVIGNLV 
VLNLFLALLLS S F SADNLAATDDDNEMNNLQI AVDRMHKGVAYVKRKI YEF IQQSFI R 
KQKILDEIKPLDDLNNKKDSCMSNHTTEIGKDLDYLKDVNGTTSGIGTGSSVEKYIID 
ESDYMSFINNPSLTVTVPIAVGESDFENLNTEDFSSESDLEESKEKLNESSSSSEGST 
VGHRRPVEEQPWEPEETLEPEACFTEGCVQRFKCCQIWEEGRGKQWWNLRRTCFRI 
VEHNWFETFIVFMILLSSGALAFEDIYIDQRKTIKTMLEYADKVFTYIFILEMLLKWV 
AYGYQTYFTNAWCWLDFLIVDVSLVSLTANALGYSELGAIKSLRTLRALRPLRALSRF 
EGMRVWNALLGAI PS IMNVLLVCLI FWLI FS IMGVNLFAGKF YHC INTTTGDRFDI E 
DVNNHTDCLKLIERNETARWKNVKVOT 

VELQPKYEESLYMYLYFVIF 1 1 FGSFFTLNLF IGVI I DNFNQQKKKFGGQDI FMTEEQ 
KKYYNAMKKLGSKKPQKPIPRPGNKFQGMVFDFVTRQVFDISIMILICLNMVTMMVET 
DDQSEYVTTILSRINLVFIVLFTGECVLKLISLRHYYFTIGWNIFDFVWILSIVGMF 
LAELIEKYFVSPTLFRVIRLARIGRILRLIKGAKGIRTLLFALMMSLPALFNIGLLLF 
LVMFIYAIFGMSNFAYVKREVGIDDMFNFETFGNSMICLFQITTSAGWDGLLAPILNS 
KPPDCDPNKVNPGSSVKGDCGNPSVGIFFFVSYIIISFLVWNMYIAVILENFSVATE 
ESAEPLSEDDFEMFYEWEKFDPDATQFMEFEKLSQFAAALEPPLNLPQPNKLQLIAM 
DLPMVSGDRIHCLDILFAFTKRVLGESGEMDALRIQMEERFMASNPSKVSYQPITTTL 
KRKQEEVSAVI IQRAYRRHLLKRTVKQASFTYNKNKIKGGANLLIKEDMI IDRINENS 
ITEKTDLTMSTAACPPSYDRVTKPIVEKHEQEGKDEKAKGK" 

ORIGIN 

* • 1 aatgtgcagg atgacaagat ggagcaaaca gtgcttgtac caccaggacc tgacagcttc 

61 aacttcttca ccagagaatc tcttgcggct attgaaagac gcattgcaga agaaaaggca 
121 . aagaatccca aaccagacaa aaaagatgac gacgaaaatg gcccaaagcc aaatagtgac 
181 ttggaagctg gaaagaacct tccatttatt tatggagaca ttcctccaga gatggtgtca 
241 gagcccctgg aggacctgga cccctactat atcaataaga aaacttttat agtattgaat 
301 aaagggaagg ccatcttccg gttcagtgcc acctctgccc tgtacatttt aactcccttc 
361 aatcctctta ggaaaatagc tattaagatt ttggtacatt cattattcag catgctaatt 
421 atgtgcacta ttttgacaaa ctgtgtgttt atgacaatga gtaaccctcc tgattggaca 
481 aagaatgtag aatacacctt cacaggaata tatacttttg aatcacttat aaaaattatt 
541 gcaaggggat tctgtttaga agattttact ttccttcggg atccatggaa ctggctcgat 
601 ttcactgtca ttacatttgc gtacgtcaca gagtttgtgg acctgggcaa tgtctcggca 
661 ttgagaacat tcagagttct ccgagcattg aagacgattt cagtcattcc aggcctgaaa 
721 accattgtgg gagccctgat ccagtctgtg aagaagctct cagatgtaat gatcctgact 
781 gtgttctgtc tgagcgtatt tgctctaatt gggctgcagc tgttcatggg caacctgagg 
841 aataaatgta tacaatggcc tcccaccaat gcttccttgg aggaacatag tatagaaaag 
901 aatataactg tgaattataa tggtacactt ataaatgaaa ctgtctttga gtttgactgg 
961 aagtcatata ttcaagattc aagatatcat tatttcctgg agggtttttt agatgcacta 
1021 ctatgtggaa atagctctga tgcaggccaa tgtccagagg gatatatgtg tgtgaaagct 
1081 ggtagaaatc ccaattatgg ctacacaagc tttgatacct tcagttgggc ttttctgtcc 
1141 ttgtttcgac taatgactca ggacttctgg gaaaatcttt atcaactgac attacgtgct 
1201 gctgggaaaa cgtacatgat attttttgtg ttggtcattt tcttgggctc attctaccta 
1261 ataaatttga tcctggctgt ggtggccatg gcctacgagg aacagaatca ggccaccttg 
1321 gaagaagcag aacagaaaga ggccgaattt cagcagatga ttgaacagct taaaaagcaa 
1381 caggaggcag ctcagcaggc agcaacggca actgcctcag aacattccag agagcccagt 
1441 gcagcaggca ggctctcaga cagctcatct gaagcctcta agttgagttc caagagtgct 
1501 aaggaaagaa gaaatcggag gaagaaaaga aaacagaaag agcagtctgg tggggaagag 
1561 aaagatgagg atgaattcca aaaatctgaa tctgaggaca gcatcaggag gaaaggtttt 
1621 cgcttctcca ttgaagggaa ccgattgaca tatgaaaaga ggtactcctc cccacaccag 
1681 tctttgttga gcatccgtgg ctccctattt tcaccaaggc gaaatagcag aacaagcctt 
1741 ttcagcttta gagggcgtgc aaaggatgtg ggatctgaga acgacttcgc agatgatgag 
1801 cacagcacct ttgaggataa cgagagccgt agagattcct tgtttgtgcc ccgacgacac 
1861 ggagagagac gcaacagcaa cctgagtcag accagtaggt catcccggat gctggcagtg 
1921 tttccagcga atgggaagat gcacagcact gtggattgca atggtgtggt ttccttggtt 
1981 ggtggacctt cagttcctac atcgcctgtt ggacagcttc tgccaggggg aacaaccact 



http://www.ncbi.nlm.nih.gov/entrez/query .fcgi?cmd==Retrieve&db=nucleotide&list_uids=126... 11/11/2003 
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2041 gaaactgaaa tgagaaagag aaggtcaagt tctttccacg tttccatgga ctttctagaa 
2101 gatccttccc aaaggcaacg agcaatgagt atagccagca ttctaacaaa tacagtagaa 
2161 gaacttgaag aatccaggca gaaatgccca ccctgttggt ataaattttc caacatattc 
2221 tcaatctggg actgttctcc atattggtta aaagtgaaac atgttgtcaa cctggtcgtg 
2281 atggacccat ttgttgacct ggccatcacc atctgtattg tcttaaatac tcttttcatg 
2341 gccatggagc actatccaat gacggaccat ttcaataatg tgcttacagt aggaaacttg 
2401 gttttcactg ggatctttac agcagaaatg tttctgaaaa ttattgccat ggatccttac 
2461 tattatttcc aagaaggctg gaatatcttt gacggtttta ttgtgacgct tagcctggta 
2521 gaacttggac tcgccaatgt ggaaggatta tctgttctcc gttcatttcg attgctgcga 
2581 gttttcaagt tggcaaaatc ttggccaacg ttaaatatgc taataaagat catcggcaat 
2641 tccgtggggg ctctgggaaa tttaaccctc gtcttggcca tcatcgtctt catttttgcc 
2701 gtggtcggca tgcagctctt tggtaaaagc tacaaagatt gtgtctgcaa gatcgccagt 
2761 gattgtcaac tcccacaacg ctggcacatg aatgacttct tccactcctt cctgattgtg 
2821 ttccgcgtgc tgtgtgggga gtggatagag accatgtggg actgtatgga ggttgctggt 
2881 caagccatgt gccttactgt cttcatgatg gtcatggtga ttggaaacct agtggtcctg 
2941 aatctctttc tggccttgct tctgagctca tttagtgcag acaaccttgc agccactgat 
3 001 gatgataatg aaatgaataa tctccaaatt gctgtggata ggatgcacaa aggagtagct 
3061 tatgtgaaaa gaaaaatata tgaatttatt caacagtcct tcattaggaa acaaaagatt 
3121 ttagatgaaa ttaaaccact tgatgatcta aacaacaaga aagacagttg tatgtccaat 
3181 catacaacag aaattgggaa agatcttgac tatcttaaag atgtaaatgg aactacaagt 
3241 ggtataggaa ctggcagcag tgttgaaaaa tacattattg atgaaagtga ttacatgtca 
3301 ttcataaaca accccagtct tactgtgact gtaccaattg ctgtaggaga atctgacttt 
3361 gaaaatttaa acacggaaga ctttagtagt gaatcggatc tggaagaaag caaagagaaa 
3421 ctgaatgaaa gcagtagctc atcagaaggt agcactgtgg gacatcggcg ccctgtagaa 
3481 gaacagcccg tagtggaacc tgaagaaact cttgaaccag aagcttgttt cactgaaggc 
3541 tgtgtacaaa gattcaagtg ttgtcaaatc aatgtggaag aaggcagagg aaaacaatgg 
3 601 tggaacctga gaaggacgtg tttccgaata gttgaacata actggtttga gaccttcatt 
3661 gttttcatga ttctccttag tagtggtgct ctggcatttg aagatatata tattgatcag 
3721 cgaaagacga ttaagacgat gttggaatat gctgacaagg ttttcactta cattttcatt 
3781 ctggaaatgc ttctaaaatg ggtggcatat ggctatcaaa catatttcac caatgcctgg 
3841 tgttggctgg acttcttaat tgttgatgtt tcattggtca gtttaacagc aaatgccttg 
3901 ggttactcag aacttggagc catcaaatct ctcaggacac taagagctct gagacctcta 
3961 agagccttat ctcgatttga agggatgagg gtggttgtga atgccctttt aggagcaatt 
4021 ccatccatca tgaatgtgct tctggtttgt cttatattct ggctaatttt cagcatcatg 
4081 ggcgtaaatt tgtttgctgg caaattctac cactgtatta acaccacaac tggtgacagg 
4141 tttgacatcg aagacgtgaa taatcatact gattgcctaa aactaataga aagaaatgag 
4201 actgctcgat ggaaaaatgt gaaagtaaac tttgataatg taggatttgg gtatctctct 
4261 ttgcttcaag ttgccacatt caaaggatgg atggatataa tgtatgcagc agttgattcc 
4321 agaaatgtgg aactccagcc taagtatgaa gaaagtctgt acatgtatct ttactttgtt 
4381 attttcatca tctttgggtc cttcttcacc ttgaacctgt ttattggtgt catcatagat 
4441 aatttcaacc agcagaaaaa gaagtttgga ggtcaagaca tctttatgac agaagaacag 
4501 aagaaatact ataatgcaat gaaaaaatta ggatcgaaaa aaccgcaaaa gcctatacct 
4561 cgaccaggaa acaaatttca aggaatggtc tttgacttcg taaccagaca agtttttgac 
4621 ataagcatca tgattctcat ctgtcttaac atggtcacaa tgatggtgga aacagatgac 
4681 cagagtgaat atgtgactac cattttgtca cgcatcaatc tggtgttcat tgtgctattt 
4741 actggagagt gtgtactgaa actcatctct ctacgccatt attattttac cattggatgg 
4801 aatatttttg attttgtggt tgtcattctc tccattgtag gtatgtttct tgccgagctg 
4861 atagaaaagt atttcgtgtc ccctaccctg ttccgagtga tccgtcttgc taggattggc 
4921 cgaatcctac gtctgatcaa aggagcaaag gggatccgca cgctgctctt tgctttgatg 
4981 atgtcccttc ctgcgttgtt taacatcggc ctcctactct tcctagtcat gttcatctac 
5041 gccatctttg ggatgtccaa ctttgcctat gttaagaggg aagttgggat cgatgacatg 
5101 ttcaactttg agacctttgg caacagcatg atctgcctat tccaaattac aacctctgct 
5161 ggctgggatg gattgctagc acccattctc aacagtaagc cacccgactg tgaccctaat 
5221 aaagttaacc ctggaagctc agttaaggga gactgtggga acccatctgt tggaattttc 
5281 ttttttgtca gttacatcat catatccttc ctggttgtgg tgaacatgta catcgcggtc 
5341 atcctggaga acttcagtgt tgctactgaa gaaagtgcag agcctctgag tgaggatgac 
5401 tttgagatgt tctatgaggt ttgggagaag tttgatcccg atgcaactca gttcatggaa 
5461 tttgaaaaat tatctcagtt tgcagctgcg cttgaaccgc ctctcaatct gccacaacca 
5521 aacaaactcc agctcattgc catggatttg cccatggtga gtggtgaccg gatccactgt 
5581 cttgatatct tatttgcttt tacaaagcgg gttctaggag agagtggaga gatggatgct 
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5641 ctacgaatac agatggaaga gcgattcatg gcttccaatc cttccaaggt ctcctatcag S 

5701 ccaatcacta ctactttaaa acgaaaacaa gaggaagtat ctgctgtcat tattcagcgt 

5761 gcttacagac gccacctttt aaagcgaact gtaaaacaag cttcctttac gtacaataaa 

5821 aacaaaatca aaggtggggc taatcttctt ataaaagaag acatgataat tgacagaata 

5881 aatgaaaact ctattacaga aaaaactgat ctgaccatgt ccactgcagc ttgtccacct 

5941 tcctatgacc gggtgacaaa gccaattgtg gaaaaacatg agcaagaagg caaagatgaa 

6001 aaagccaaag ggaaataaat gaaaataaat aaaaataatt gggtgacaaa ttgtttacag 

6061 cctgtgaagg tgatgtattt ttatcaacag gactccttta ggaggtcaat gccaaactga 

6121 ctgtttttac acaaatctcc ttaaggtcag tgcctacaat aagacagtga ccccttgtca 

6181 gcaaactgtg actctgtgta aaggggagat gaccttgaca ggagattact gttctcacta 

6241 ccagctgaca ctgctgaaga taagatgcac aatggctagt cagactgtag ggaccagttt 

6301 caaggggtgc aaacctgtga ttttggggtt gtttaacatg aaacacttta gtgtagtaat 

6361 tgtatccact gtttgcattt caactgccac atttgtcaca tttttatgga atctgttagt 

6421 ggattcatct ttttgttaat ccatgtgttt attatatgtg actatttttg taaacgaagt 

6481 ttctgttgag aaataggcna aggacctcta taacangtat gccacctggg gggtanggca 

6541 accacatggc nctcccagct acacaaagtc gtggtttgca tgagggcatg ctgcacttag 

6601 agatcatgca tgagaaaaag tcacaagaaa aacaaattct taaatttcac catatttctg 

6661 ggaggggtaa ttgggngata agtggaggtg ctttgttgat cttgttttgc gaaatccagc 

6721 ccctanacca agtagattgt ttgtgggtag gncagtaaat cttagcaggt gcaaacttca 

6781 ttcaaatgtt tggagtcata aatgttatgt ttctttttgt tgtattaaaa aaaaacctga 

6841 atagtgaata ttgcccctca ccctccaccg ccagaagact gaattgacca aaattactct 

6901 ttataaattt ctgctttttc ctgcactttg tttagccatc ttcggctctc agcaaggttg 

6961 acactgtata tgttaatgaa atgctattta ttatgtaaat agtcatttta ccctgtggtg 

7021 cacgtttgag caaacaaata acgacctaag cacagtattt attgcatcaa atatgtacca 

7081 caagaaatgt agagtgcaag ctttacacag gtaataaaat gtattctgta ccatttatag 

7141 atagtttgga tgctatcaat gcatgtttat attaccatgc tgctgtatct ggtttctctc 

7201 actgctcaga atctcattta tgagaaacca tatgtcagtg gtaaagtcaa ggaaattgtt 

7261 caacagatct catttattta agtcattaag caatagtttg cagcacttta acagcttttt 

7321 ggttattttt acattttaag tggataacat aggtatatag ccagactgta cagacatgtt 

7381 taaaaaaaca cactgcttaa cctattaaat atgtgtttag aattttataa gcaaatataa 

7441 atactgtaaa aagtcacttt attttatttt tcagcattat gtacataaat atgaagagga 

7501 aattatcttc aggttgatat cacaatcact tttcttactt tctgtccata gtactttttc 

7561 atgaaagaaa tttgctaaat aagacatgaa aacaagactg ggtagttgta gatttctgct 

7 621 ttttaaatta catttgctaa ttttagatta tttcacaatt ttaaggagca aaataggttc 

7681 acgattcata tccaaattat gctttgcaat tggaaaaggg tttaaaattt tatttatatt 

7741 tctggtagta cctgtactaa ctgaattgaa ggtagtgctt atgttatttt tgttcttttt 

7801 ttctgacttc ggtttatgtt ttcatttctt tggagtaatg ctgctctaga ttgttctaaa 

7861 tagaatgtgg gcttcataat ttttttttcc acaaaaacag agtagtcaac ttatatagtc 

7921 aattacatca ggacattttg tgtttcttac agaagcaaac cataggctcc tcttttcctt 

7981 aaaactactt agataaactg tattcgtgaa ctgcatgctg gaaaatgcta ctattatgct 

8041 aaataatgct aaccaacatt taaaatgtgc aaaactaata aagattacat tttttattcg 

8101 aaaaaaggaa aaaaaaaaaa aaaaaaaaaa a 
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